Document Stream Evaluation using a Sub-Topic Model
نویسندگان
چکیده
منابع مشابه
A Discriminative Topic Model using Document Network Structure
Document collections often have links between documents—citations, hyperlinks, or revisions—and which links are added is often based on topical similarity. To model these intuitions, we introduce a new topic model for documents situated within a network structure, integrating latent blocks of documents with a max-margin learning criterion for link prediction using topicand word-level features. ...
متن کاملAn Automatic Approach for Document-level Topic Model Evaluation
Topic models jointly learn topics and document-level topic distribution. Extrinsic evaluation of topic models tends to focus exclusively on topic-level evaluation, e.g. by assessing the coherence of topics. We demonstrate that there can be large discrepancies between topicand documentlevel model quality, and that basing model evaluation on topic-level analysis can be highly misleading. We propo...
متن کاملA Dynamic Topic Model for Document Segmentation
Factor language models, like Latent Semantic Analysis, represent documents as mixtures of topics, and have a variety of applications. Normally, the mixture is computed at the whole-document level, that is, the entire document contains material on several topics, without specifying where they occur in the document. In this paper, we describe a new model which computes the topic mixture estimate ...
متن کاملDiscovery of Rare Sequential Topic Patterns in Document Stream
When and Where: Predicting Human Movements Based on Social Spatial-Temporal Events Ning Yang*, Sichuan University; Xiangnan Kong, University of Illinois at Chicago; Fengjiao Wang, University of Illinois at Chicago; Philip Yu, University of Active Multitask Learning Using Both Latent and Supervised Shared Topics Ayan Acharya*, University of Texas at Austin; Raymond Mooney, University of Texas at...
متن کاملDocument Visualization using Topic Clouds
Traditionally a document is visualized by a word cloud. Recently, distributed representation methods for documents have been developed, which map a document to a set of topic embeddings. Visualizing such a representation is useful to present the semantics of a document in higher granularity; it is also challenging, as there are multiple topics, each containing multiple words. We propose to visu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Japan Society for Fuzzy Theory and Intelligent Informatics
سال: 2006
ISSN: 1881-7203,1347-7986
DOI: 10.3156/jsoft.18.280